# Chinese optimization
Baidu ERNIE 4.5 0.3B PT GGUF
Apache-2.0
A quantized version based on the Baidu ERNIE-4.5-0.3B-PT model, optimized through the llama.cpp tool to reduce the model size and improve the running efficiency.
Large Language Model Supports Multiple Languages
B
bartowski
314
3
Skywork Skywork SWE 32B GGUF
Apache-2.0
Skywork-SWE-32B is a large language model with 32B parameters. It is quantized by Llamacpp imatrix and can run efficiently in resource-constrained environments.
Large Language Model
S
bartowski
921
2
Qwen3 30B A3B Gptq 8bit
Apache-2.0
Qwen3 30B A3B is a large language model that has undergone 8-bit quantization using the GPTQ method, suitable for efficient inference scenarios.
Large Language Model
Transformers

Q
btbtyler09
301
2
Smoothie Qwen3 4B
Apache-2.0
Smoothie Qwen is a lightweight adjustment tool that can smooth the token probabilities in Qwen and similar models and enhance the multilingual balanced generation ability.
Large Language Model
Transformers English

S
dnotitia
2,189
2
React Native Executorch Qwen 3
Apache-2.0
Qwen 3 is a language model based on the ExecuTorch runtime, offering both quantized and non - quantized versions of different scales.
Large Language Model
R
software-mansion
732
1
Qwq DeepSeek R1 SkyT1 Flash Lightest 32B
This is a merged model based on Qwen2.5-32B, incorporating features from DeepSeek-R1-Distill-Qwen-32B, QwQ-32B, and Sky-T1-32B-Flash to enhance performance.
Large Language Model
Transformers

Q
sm54
14
4
Qwen2.5 14B YOYO V2
Qwen2.5-14B-YOYO-V5 is an enhanced version based on the Qwen2.5-14B foundation model, created by merging multiple pre-trained language models.
Large Language Model
Transformers

Q
YOYO-AI
14
2
Qwen2.5 VL 7B Instruct GPTQ Int4
Apache-2.0
Qwen2.5-VL-7B-Instruct-GPTQ-Int4 is an unofficial GPTQ-Int4 quantized version based on the Qwen2.5-VL-7B-Instruct model, supporting multimodal tasks from image-text to text.
Image-to-Text
Transformers Supports Multiple Languages

Q
hfl
872
3
Qwen2 VL 7B Instruct GGUF
Apache-2.0
Qwen2-VL-7B-Instruct is a multimodal vision-language model that supports the joint understanding and generation of images and text.
Text-to-Image
Transformers English

Q
tensorblock
124
0
Qwen2 VL 2B Instruct GGUF
Apache-2.0
Qwen2-VL-2B-Instruct is a vision-language model that provides a quantized version in GGUF format, suitable for the llama.cpp environment.
Text-to-Image
Transformers English

Q
tensorblock
107
0
Moxin 7B LLM
Apache-2.0
Moxin 7B is a powerful open-source large language model that offers various types such as base models and chat models, and has demonstrated good performance on multiple common datasets.
Large Language Model
Transformers

M
moxin-org
134
17
Glm Edge V 5b
Other
GLM-Edge-V-5B is a 5-billion-parameter multimodal model that supports image and text inputs, capable of performing image understanding and text generation tasks.
Image-to-Text
G
THUDM
4,357
12
Skywork Critic Llama 3.1 8B
Other
The Skywork Critic series of models are advanced judgment models that excel in paired preference evaluation. They can compare and evaluate a pair of input contents and provide detailed judgments.
Large Language Model
PyTorch
S
Skywork
1,376
12
Qwen2
Other
The large language model of the Tongyi Qianwen Qwen2 series, which includes models with multiple parameter scales, ranging from 500 million to 72 billion parameters, and supports instruction tuning.
Large Language Model
Q
cortexso
132
1
Qwen2 7B Int4 Inc
Apache-2.0
INT4 auto-quantized model based on Qwen2-7B, generated by Intel's auto-round tool, suitable for efficient inference tasks
Large Language Model
Transformers

Q
Intel
48
6
Yi 1.5 9B
Apache-2.0
Yi-1.5 is an upgraded version of the Yi model, excelling in programming, mathematics, reasoning, and instruction-following capabilities while maintaining excellent language understanding, commonsense reasoning, and reading comprehension.
Large Language Model
Transformers

Y
01-ai
6,140
48
Meditron 7b Llm Radiology
Apache-2.0
This is an open-source model under the Apache-2.0 license. Specific information needs to be supplemented.
Large Language Model
Transformers

M
nitinaggarwal12
26
1
DNABERT S
Apache-2.0
This is an open-source model based on the Apache-2.0 license. Specific functionalities should be referenced in the actual model documentation
Large Language Model
Transformers

D
zhihan1996
2,815
7
Chattruth 7B
ChatTruth-7B is a multilingual vision-language model optimized based on the Qwen-VL architecture, enhanced with large-resolution image processing capabilities and incorporating a restoration module to reduce computational overhead
Image-to-Text
Transformers Supports Multiple Languages

C
mingdali
73
13
Tess M V1.3
Other
Tess-M-v1.3 is a large language model trained on the Yi-34B-200K architecture, belonging to the general-purpose large language model series with ultra-long context processing capabilities.
Large Language Model
Transformers

T
migtissera
99
26
Chinese Llama 2 7b Gguf
Apache-2.0
GGUF-v3 version file of the Chinese LLaMA-2-7B model adapted to llama.cpp
Large Language Model
Transformers Supports Multiple Languages

C
hfl
254
5
Chinese Llama 2 13b 16k
Apache-2.0
A complete Chinese LLaMA-2-13B-16K model that supports a 16K context length and can be directly loaded for inference and full-parameter training
Large Language Model
Transformers Supports Multiple Languages

C
hfl
10.62k
14
Pai Diffusion General Large Zh
Apache-2.0
Alibaba PAI team's open-source Chinese latent diffusion model supporting Chinese text-to-image generation
Image Generation
P
alibaba-pai
15
2
Elasticbert Base
ElasticBERT is an efficient multi-exit BERT model that supports dynamic adjustment of computational resources.
Large Language Model
Transformers English

E
fnlp
252
3
Randeng MegatronT5 770M
Apache-2.0
Chinese version of T5-large model specialized in natural language conversion tasks
Machine Translation
Transformers Chinese

R
IDEA-CCNL
16
7
Featured Recommended AI Models